CHAPTER 16 Getting Straight Talk on Straight-Line Regression 223

As stated at the beginning of this section, you calculate the residual for each point

by subtracting the predicted Y from the observed Y. As shown in Figure 16-6, a

residuals versus fitted graph displays the values of the residuals plotted along the Y

axis and the predicted Y values from the fitted straight line plotted along the X

axis. A normal Q-Q graph shows the standardized residuals, which are the residuals

divided by the RMS value, along the Y axis, and theoretical quantiles along the X

axis. Theoretical quantiles are what you’d expect the standardized residuals to be if

they were exactly normally distributed.

Together, the two graphs shown in Figure 16-6 provide insight into whether your

data conforms to the requirements for straight-line regression:»

» Your data must lie above and below the line randomly across the whole

range of data.»

» The average amount of scatter must be fairly constant across the whole range

of data.»

» The residuals should be approximately normally distributed.

You need years of experience examining residual plots before you can interpret

them confidently, so don’t feel discouraged if you can’t tell whether your data

complies with the requirements for straight-line regression from these graphs.

Here’s how we interpret them, allowing for other biostatisticians to disagree:»

» We read the residuals versus fitted chart in Figure 16-6 to show the points

lying equally above and below the fitted line, because this appears true

whether you’re looking at the left, middle, or right part of the graph.

FIGURE 16-6:

The residuals

versus fitted (a)

and normal (b)

Q-Q graphs help

you determine

whether your

data meets the

requirements for

straight-line

regression.

© John Wiley & Sons, Inc.